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A method of assigning objects to processing units 



Description 



Field of the invention 

5 The present invention relates to the field of data processing, and more particu- 
larly without limitation, to object size balancing in a multi-computing environ- 
ment 
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Background and prior art 

Various multi-computing architectures are known from tlie prior art where a plu- 
rality of processing units is coupled to form a cluster. Such architectures are 
5 used in parallel processing and also In the emerging field of blade computing. 

Blade computing relies on blade servers, which are modular, single-board com- 
puters. An oven/lew of blade computing is given in "Architectures and Infrastruc- 
ture for Blade Computing", September 2002, Sun microsystems and "THE 
lU NEXT WAVE: BIADE SERVER COMPUTING". Sun Microsystems 
(www.sunxom/servers/entrvAalade^ . 

A content toad balancing blade Is commercially available from Sun microsys- 
tems ("Sun Fire TM 81 On). This blade provides traffic and content management 
15 functionalities. Content load balancing is achieved based on URLs, CGI scripts 
and cookies; server' load balancing is achieved based on server loads, re- 
sponse times, and weighted round-robin algorithms. 

US patent application no. 20030105903 shows a web edge server, which com- 
20 prises a number of blade servers. A switch and an information distribution mod- 
ule are provided for the purpose of balancing. The information distribution mod- 
ule receives an information message, performs processing on the message to 
determine a destination, and forwards a message .toward the determined 
destination via an internal communications network. 

2J : 

• Summary of the invention 

The present Invention provides for a method of assigning objects to processing 
units of a cluster of processing units. Each one of the processing units has a 

30 certain storage capacity. For the purpose of balancing the sizes of objects of the 
individual processing units, a given number of objects needs to be distributed. 

This is accomplished by sorting of the objects by size, which provides a se- 
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quence of objects. This sequence is used for assigning of objects to processing 
units. 

The procedure for assigning of objects to a processing unit, starts with, the larg- 
5, est object of the sequence and continues until thej'emaining storage capacity of 
the processing unit is below the size of the smallest remaining object of the se- 
quence. When this condition is fulfilled, the procedure is carried out again for 
the next processing unit, whereby the objects which have been previously as- 
signed, are deleted from the sequence. This way a minimum number of proc- 
10 essing units, which are required for handling a given set of objects can be de- 
termined. 

In accordance with a preferred embodiment of the invention the remaining stor- 
age capacity of a processing unit is determined by tlie difference between the 
15 storage capacity of the unit and the aggregated size of objects, which have 
been assigned to the processing unit. On the basis of this definition of the re- 
maining storage capacity, the minimum number of processing units is deter- 
mined. 

20 In accordance with a further preferred embodiment of the invention, the object 
• size balancing procedure is performed again in order to further improve the ob- 
ject size balancing. For this purpose the largest gap between the aggregated 
sizes of objects being assigned to one of the processing units and the maximum 
storage capacity Is determined. 

25 

This gap is divided by the minimum number of processing units and the result of 
the division is subtracted from the maximum storage capacity to provide a 
threshold level. When the procedure for assigning the objects to the processing 
units is performed again, the definition, of the remaining storage capacity is the 
30 .difference between the aggregated size of the objects being assigned to the 
processing unit and the threshold level. As a result of the renewed performance 
of the assignment procedure, the gap can be substantially reduced. 
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In accordance with a further preferred embodiment of the invention, the theo- 
retical storage capacity limit for a perfectly evenly distributed load is used as a 
threshold. This threshold is obtained by calculating the difference between the 
5 total of the storage capacities of the processing units and the total of the sizes 
of the objects and dividing the difference by the minimum number of processing 
units. The result of the division is subtracted from the storage capacity, which 
provides the theoretical limit. 

10 The assignment procedure is performed again, whereby the remaining storage 
capacity is defined as the difference between the aggregated size of the objects 
of a processing unit and the threshold. Typically the storage capacity of the last 
processing unit of the minimum number of processing units, to which the ob- 
jects are assigned in the procedure, will not be sufficient to accommodate ail of 

1 5 the remaining objects of the sequence. 

In this case one ore more Iterations are performed. For one iteration the excess 
amount of memory is divided by the minimum number of processing units. The 
result of the division is added to the threshold and the assignment procedure is 
20 performed again. This process continues until the storage capacity of the last 
processing unit, to which the remaining objects of the sequence are assigned in 
the procedure, is sufficient to accommodate all these objects. This way the ob- 
ject size balancing Is further improved. 

25 — In accordaojcajMilbjaJUil^^ 

old for performing the assignment procedure is varied between the theoretical 
limit and the storage capacity. For each value of the threshold, a new assign* 
ment procedure is performed. For each of the assignments of objects to proc- 
essing units, a statistical measure is calculated. This statistical measure is a 

30 basis to select one of the assignments for optimal object size balancing. 
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In accordance with a further preferred embodiment of the invention the standard 
deviation or variance of the sum of the object sizes assigned to a processing 
unit is used as a statistical measure. The standard deviations obtained for the 
processing units as a result of the assignment procedure are stored as an over- 
5 all quality measure of the assignment. The assignment having the lowest overall 
quality measure Is selected. 

In accordance with a further preferred embodiment of the invention, each one of 
the processing units is a blade or a blade server. One of the blades can have a 
10 program, which implements the principles of the present invention, in order to 
perform object size balancing. This way the number of swap-operations be- 
tween the blades can be minimized. 

In accordance with a further preferred embodiment of the invention the princi- 
15 pies of the invention are implemented in an application program mnning on a 
personal computer. The application program Is provided with a list of objects 
and the estimated sizes of the objects, which needs to be handled by the cluster 
of processing units. On the basis of the object sizes, the minimum number of 
processing units which are required for the processing can be detemnined. This 
20 information can form the basis for a conresponding investment decision of a 
customer. 

It is to be noted that the present invention is not restricted to a particular type of 
objects. For example, data objects such as tables, arrays, lists, and trees are 
25 distributed to processing units, e.g. blades, in accordance with the principles of 
the present Invention. For example, each one of the processing units runs a 
data processing task to which the respective objects are assigned. 

Brief description of the drawings 

3U 

In the following, prefenred embodiments of the invention will be described in 
greater detail by making reference to the drawings in which: 
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ngure 1 is a schematic block diagram of a modular computer system, having 
a cluster of blades, 

-5 Figure 2 Is Illustrative of a flow diagram for assigning of objects to blades and 
for determining the minimum number of blades, 

Figure 3 is an example for tables, which need to be assigned to blades, 

10 Figure 4 shows the result of a sorting operation, 

Figure 5 shows a first step of assigning a table to a first one of the blades, 

Figure 6 shows a second step for assigning a table to the first blade, 

15 

Figure 7 shows the first assignment of a table to a second blade, 

Figure 8 shows a second assignment of a table to the second blade, 

20 Figure 9 shows the assignment of three further tables to the second blade, 

Figure 10 shows the resulting assignment of tables to blades as a result of the 
assignment procedure, 

^gu r e 1 1 I s i »ustr3t i ve-of-a-prefeiyed-embodiF>ent-Qf-the-tft^^ — 



procedure of figure 2 is performed again with a lower threshold, 

Rgure 12 is illustrative of the lower threshold, 

30 Figure 1 3 is illustrative of the result of the .renewed performance of the proce- 
dure of figure 2 with the lower threshold, 
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Rgure 14 Is illustrative of a preferred embodiment of a method of the invention 
where the threshold is varied iteratively, 

Figure 15 is illustrative of the starting point of the iteration. 
Figure 16 shows the result of the first iteration, 

Figure 17 shows the resulting assignment of objects to the minimum number of 
blades after completion of the procedure of figure 14. 

Figure 18 is illustrative of a further prefenred embodiment of the invention, 
where the threshold is varied in predetermined steps. 

Figure 19 is illustrative of the discrete continuum in which the threshold is var- 
* 5 led and the result of the assignment procedure. 

Figure 20 is illustrative of a computer system performing the assignment of ob- 
jects to blades. 
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Detailed description 



, Figure 1 shows cluster 100 of blades Bi. B2. 83 Bn. Each one of the blades 

has processor 102 and memory 104. In the example considered here, all 
25 memories 104 have the same storage capacity. The blades are coupled by a 
network 106, such as a bus system. The number N of blades of cluster 100 
needs to be chosen, such that a given number of M objects of varying sizes can 
be handled. 

30 For example, cluster 100 implements a so called search engine. In this instance 
Identical search processors mn on each one of the blades. The assignment of 
data objects, such as index tables, to blades can be stored in a dispatcher unit 
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(not shown in the drawing) of cluster 100. This way data objects are assigned to 
blades and data processing tasks running on the blades. 

Figure 2 shows the corresponding procedure for assigning the objects to blades 
.5 and thereby determine the minimum value for N. 

In step 200 a sorting operation Is performed in order to sort the M objects by 
size. The corresponding object sequence is provided in step 202. In step 204 
the index i for the blades is Initialised to one. 

10 * 

In step 206 processing of the object sequence starts in the order starting- vfiith 
the largest object of the sequence. The first object of the sequence, which by 
♦definition is the largest object of the sequence, is assigned to blade Bi in step 
206. In step 208 the fir^t object which has been assigned to blade Bi is deleted 
15 from the sequence. 

In step 210 the size of the objects, which have been already assigned, to blade 
Bi is added up and a gap G between the aggregated object size and a thresh- 
old is calculated. When the assignment procedure of Fig. 2 is earned out for the 
20 first time, the threshold is the storage capacity of one of the blades. 

• In step 212 it is determined whether there remains an object In the sequence, 
which fits into the gap G. If this is the case, the largest of these objects is as- 
signed to the blade Bi in step 214 and deleted from the sequence before the 
25 cpntrol goes bqc^tO-Step-Zm 

If there Is no such object which fits into the gap G, step 218 is carried but. In 
step 218 it is determined whether all objects have already been assigned to 
blades. In other words, in step 218 it is checked whether the sequence is 
30 empty. If this is not the case the index i Is incremented in step 220 and the con- 
trol goes back to step 206 to assign remaining objects of the sequence of the 
next blade 82. 
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If the cx>ntrary is the case,* the index i equals the minimum number N of blades 
which are required to handle the M objects. This number is outputted in step 
222. The minimum number N of blades can be a basis for an investment deci- 
,i sion for purchasing of a conesponding number of blades. Further, the assign- 
ment of objects to blades is outputted in step 224 In order to visualize the qual- 
ity of the object size balancing. 

Figure 3 shoNA/s an example. In the example considered here the objects are a 
10 number of twenty different tables having various sizes between 50 MB and 
3566 MB as indicated In figure 3. For example, table 1 has a size of 3250 MB. 
table 2 has 250 MB. table 3 has 750 MB, etc. The table sizes can be actual ta- 
ble sizes or average table sizes vwhich have been obtained by monitoring a real 
life data processing system. Alternatively the table sizes are estimates for the 
1 5 purpose of planning cluster 1 00. 

Figure 4 shows the result of the sorting operation performed on the tables 1 to 
20 of figure 3 (cf. step 202 of figure 2). 

21) Figure 5 illustrates the assignment of the first object of the sequence, i.e. the 
largest table 20 to blade Bi. In the example considered here, each blade has a 
storage capacity of 4 GB = 4096 MB of main memory. Table 20 has a size of 
2566 MB, which leaves a gap G of 530 MB of remaining storage capacity (cf. 
step 210 of figure 2). 

25 

Next it is determined whether there is a next object in the sequence which fits 
into the gap G. Table 12, which has a size of 520 MB Is the largest table which 
fits Into the gap G. This table 12 is thus also assigned to blade 1. The aggre- 
gated size of the objects assigned to blade 1, i.e. table 20 and table 12. is 4068 
30 MB, which leaves a gap G of 10 MB. This gap G of 10 MB is too small to ac- 
commodate even the smallest remaining object of the sequence of tables. 



I 

i 
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As there remain tables in the sequence which have not yet been assigned to a 
blade Ihe index I is incremented and the assignment procedure goes to the next 
blade Ba (of. steps 218 and 220 of figure 2). With respect to blade Bz the above- 
explained procedure is canned out again on the basis of the unassigned tables, 
5 which remain in the sequence. 

This way the largest remaining table of the sequence, i.e. table 15, is assigned 
to blade B2 which leaves a gap G of 596 MB, The gap G is filled with tables 6, 2, 
13 and 14 as Illustrated in figures 7 and 8. The resulting assignment of tables to 
ici blade 82 is shown in figure 9. ' ' ' 

The aggregated size of the tables, which have been assigned to blade 82, i.e. 
tables 15, 6, 2. 13 and 14, leave a gap G of 76 MB which is not enough to ac- 
commodate the smallest unassigned table, i.e. table 11. of the sequence. Thus. 
15 the index I is incremented and the assignment procedure is continued for the 
next blade B3. This process goes on until all tables of the sequence have been 
assigned to one blade Bj! The result of the assignments of tables to blades is 
illustrated in figure 1 0. 

20 In addition to the assignment of tables to blades this way the minimum number 
N of blades, which are required for handling of the given number of tables (cf. 
figure 3), is obtained. In the example considered here, the resulting assignment 
of tables to the IM = 8 biddes leaves a gap G of 2196 MB on blade 8. In order to 
further improve the object size balancing the method of figure 11 Is carried out 

25 

in step 1100 the largest gap G is determined. In the example shown in figure 
10. this is the gap G of b>lade Bb. The other blades Bi to B7 have smaller gaps 
between the aggregated size of the tables assigned to the corresponding blade 
and the storage capacity of 4 GB. 

30 

In step 1102 the gap G detemiined In step 1100 is divided by the number N of 
blades. In the example of figure 10, this means that G = 2196 MB is divided by 
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N = 8 in order to obtain the value of Delta 1 = 275 MB. In step 1 104 a threshold 
Is calculated by subtracting Delta 1 from the storage capacity, i.e. threshold = 
4096 MB - 275 MB = 3821 MB. 

5 With the threshold calculated in step 1 104 the methpd of figure 2 Is performed 
again in step 1106. The resulting assignment of the objects to the blades is 
more evenly distributed due to the lowering of the threshold! This is illustrated 
by way of example in figures 1 2 and 1 3 for the example of figure 10. 

. 10 Figure 12 shows the threshold T, which has been calculated in step 11 04. With ' 
the louvered threshold T the assignment procedure of figure 2 is restarted from 
the beginning whereby steps 200 and 202 do not need to be performed again, if 
the sorted object sequence has been stored when the procedure of figure 2 was 
canried out the first time. 

1.^ 

The resulting assignment of database tables to blades after the renewed per- 
formance of the procedure of figure 2 with the lowered threshold T is shown In 
figure 13. As apparent from the comparison of figures 10 and 13 the load is 
more evenly balanced between the blades after the renewed assignment pro- 
20 cedure. 

Figure 14 shows an alternative approach for refining the object size balancing. 
In step 1400 Delta 2 Is calculated by calculating the difference of the sum of the 
storage capacity of the blades and the sum of the object sizes of the objects to 
2.i be assigned to the blades and by dividing the difference by the number of 
blades. In step 1402 the threshold Is calculated by subtracting Delta 2 from the 
storage capacity. This threshold is the theoretical limit for the minimum storage 
capacities required on the individual blades in order to accommodate the ob- 
jects if it where possible to distribute the objects with finest granularity. 

. In step 1404 the method of figure 2 is performed again with the threshold as 
detemiined in step 1402 whereby the number N is fixed, i.e. for the last blade 
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Bn which is processed Die storage capacity will not be sufficient in most cases. 
In the resulting assignment of objects to blades, it is checked whether for the 
last blade, which has been processed, there is in fact an excess amount of 
memory requirement, which exceeds the storage capacity. 

5 

If this Is not the case, the assignment of objects to blades is outputted in step 
1408. If the opposite is the case, the excess amount of memory is divided by 
the number of blades N which provides Delta 3. In step 1412 the threshold is 
incremented by Delta 3 and the control goes back to step 1404. 

10 

Steps 1404, 1406, 1410 and 1412 are carried out repeatedly until there is no 
longer an excess amount of memory. 

Rgure 15 is based on the example of figure 10 and shows the threshold T as 
15 calculated in accordance with step 1402 of figure 14. In the example considered 
here, the difference between the sum of the storage capacities of the blades 
and the sum of the table sizes is 3 GB. The 3 GB are evenly distributed over the 
8 blades, which provides the threshold T. 

20 If there is no excess amount of memory as a result of one iteration but a gap 
between the aggregated size of objects, which have been assigned to the last 
blade N. the procedure is continued in order to reduce the gap. This can be 
done by dividing the gap by the number of blades N and distributing the result 
over the blades by increasing the threshold correspondingly. The gap is calcu- 

-25 lated-3sJoUowsUhi:eshokI-T--suna-of4he-5i2es-oM^ 

N. 

In this instance the process is stopped if (i) there is no significant change from 
one iteration to. the next (ii) the iterations toggle between different results, (iii) 
30 the standard deviation of the distribution of the objects does not improve or (iv) 
a maximum number of iterations has been reached. . 
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Figure 16 shows the result of the assignment procedure of figure 2. which has 
been performed with the threshold T as detenmined in step 1402. As a result of 
the assignment procedure there is an excess amount of memory E for blade Bs- 
In the example considered here the excess memory amount E is 858 MB. In 
5 accordance with step 1410 the excess amount E Is divided by the number of 
blades N = 8. In accordance with step 1412 the resulting amount of memory 
Delta 3 = 107 MB is added to the threshold. Next the assignment method of fig- 
ure 2 is carried out again with the Increased threshold, which provides the result 
as shown in figure 17. 

10 

Figure IB shows a further alterriative for refinement of the object size balancing.' 
First the step 1400 of the method of figure 14 is carried out In order to calculate 
Delta 2. Delta 2 is equivalent to the gap between the theoretical limit. I.e. the 
threshold as calculated in step 1402 of the method of figure 14, and the storage 
1 5 capacity of a blade. 

This gap is scanned by a stepwise variation of the threshold^ in order to identify 
an assignment of objects to blades which is balanced. The number of steps. I.e. 
the number of increments of the threshold, can be predefined or is user- 
20 selectable. 

In step 1800 Delta 2 is divided by the number of increments, which provides 
Delta 4. In step 1802 the threshold is calculated by dividing the sum of the ob- 
ject sizes by the number of blades N. Wilh this threshold the assignment 
25 method of figure 2 is performed again in step 1804. 

In step 1806 a statistical measure is calculated as a quality measure for the as- 
signment of objects to blades obtained as a result of step 1804. For example, 
the standard deviation of the aggregated sizes of objects assigned to each one 
30 of the blades Is calculated. 



"Ti[^ 08Ba39944eg 



I 08.08.8003 



14 

In other words, for each blade the total of the sizes of the objects, which have 
been assigned to the blade, is calculated. This provides one total size per 
blade. Next the standard deviation Is calculated for the total sizes. 

In step 1808 the threshold is incremented by Delta 4 and the control goes back 
to step 1804. This procedure is continued until the threshold has reached the 
storage capacity, i.e. the upper limit. 

In step 1810 one of the assignments obtained as a result of step 1804 is se- 
ni lected on the basis of the overall statistical measure. For example, the assign- ' 
ment having the lowest standard deviation is selected. 

Figure 19 illustrates this method with respect to the example shown In figure 10. 
The threshold T of 3712 MB is obtained by the calculation of step 1802. From 

15 there the threshold is stepwise increased in increments of Delta 4. which is 
Delta 2 = 384 MB divided my the number of increments. For example, the num- 
ber of Increments is 100. For each assignment procedure the standard devia- 
. tion of the table sizes assigned to blades Is calculated for selection of one of the 
assignments. Preferably the standard deviations are calculated only for those 

20 assignments which fit onto the minimum number of blades. 

Figure 20 shows a computer 108, which has processor 110 for mnning program 
112. Program 112 has module 114 for sorting of objects by size and module 
1 1 6 for assigning of objects to blades. 

25 , 

Further computer 108 has storage 1 18 for storing a table listing the objects and 
object sizes to be assigned to blades, storage 120 for storage of a storage ca- 
pacity value of the blades and storage 122 for storing pf the number of blades. 
Further computer 108 has interface 124 for coupling to workstation 126. 
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In operation the table with the object names/numbers and object sizes is en- 
tered via interface 124 and stored in storage 118. This corresponds to the in- 
formation shown in figure 3. 

5 Further a storage capacity value for the storage capacity of each individual 
blade is entered via interface 124 and stored in storage 120. In the example 
considered here, the storage capacity value is 4 GB. 

Next program 112 Is Invoked. Program 112 sorts the table of storage 118 by 
ID size to provide a sequence of objects (cf. figure 4). Next module 116 performs 
the method of figure 2 in order to determine the minimum number of required 
blades. This minimum number is stored in storage 122 and is outputted via user 
Interface 124. This number can be a basis for a users investment decision for 
purchasing the number of blades to realize a data processing system being ca- 
1 5 pable of handling the objects as listed in the table. 

In addition, module 116 can perform the methods of figure 11, figure 14 and/or 
figure 18 for refinement of the object size balancing. 

2U Alternatively, computer 108 is one of the blades. In this instance computer 108 
can dynamically change the assignment of objects to blades when the object 
' size changes. This way frequent swapping operations for swapping objects be- 
tween blades can be prevented. 
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List of Reference Numerals 



100 cluster 

102 processor 

104 memory 

106 network 

108 computer 

110 processor 

112 program 

114 module 

116 module 
118 • storage 

120 storage 

122 storage 

124 interface 

126 workstation 
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Claims 



1 . A method of assigning objects to processing units of a cluster of process- 
ing units, each one of the processing units having a storage capacity, the 
method comprising the steps of: 

a) sorting of the objects by size to provide a sequence of objects; 

b) for each processing unit of the cluster: 

- assigning of one ore more of the objects to the processing unit 
starting with the largest object In the sequence until a remaining 
storage capacity of the processing unit is below the smallest ot>- 
ject of the sequence; 



- deleting of the objects which are assigned to a processing unit 
frori) the sequence. 

2. The method of claim 1. whereby step 1 b) is carried out repeatedly until 
the sequence is empty which provides a minimum number of the process- 
ing units. 

3. The method of daim 1 or 2, whereby the remaining storage capacity is 
determined by the difference between the storage capacity and the aggre- 
gated size of objects being assigned to the processing unit. 

4. The method of claim 3» further comprising the steps of: 

c) determining a largest gap between the aggregated size of objects be- 
• Ing assigned to one of the processing units and the storage capacity, 
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d) subtracting the gap divided by the minimum number, of processing 
' units from the storage capacity to provide a first threshold, 

e) perfonning step 1 b) again, whereby the remaining storage capacity is 
the difference between the aggregated size of the objects being as- 



5. The method of any one of the preceding claims 1 to 4, further comprising 
•the steps of: 



c) detenmining the total of the sizes of the objects, 

d) determining the difference between the total of the storage capacities 
of the minimum number of processing units and the total of the sizes 



e) subtracting the difference divided by the minimum number of process- 
ing units from the storage capacity to provide a second threshold. 



f) performing step 1b) again, whereby the remaining storage capacity is 
.determined by the difference between the aggregated size of the ob- 
jects being assigned to the processing unit and the second threshold, 

— — g) — In c a se th a t a n a r es u lt of step 5f ) there Is an excess-amQimt-of-mem— 
ory requirement for one of the processing units which surpasses (he 
storage capacity, dividing the excess amount by the minimum number 
of processing units and increasing the second threshold by the result 
of the division. 



whereby steps 5 f) and 5 g) are performed repeatedly until there Is no 
such excess amount. 



signed to the processing units and the first threshold. 
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The method of claim 5, whereby if Ihere is no such excess amount but a 
gap, dividing the gap by the minimum number of processing units and de- 
creasing the second threshold by the result of the division. 

The method of any one of the proceeding claims 1 to 6, further comprising 
the steps of: 

c) stepwise varying a third threshold between, first and second limits. 



d) performing step 1 b) for each third threshold value, whereby the re- 
maining storage capacity is the difference between the aggregated 
size of the objects being assigned to the processing unit and the third 
threshold, and whereby a statistical measure Is calculated for the as- 

1 5 signment of objects to the processing unit. 

e) selecting of one of the assignments of objects to processing units 
based on the statistical measure. 

20 8. The method of claim 7, whereby the first limit is the aggregated size of the 
objects divided by the minimum number of processing units, and whereby 
(he second limit is the storage capacity. 

9. The method of claims 7 or 8, whereby the statistical measure calculated by 
2? calculation the standard deviation or the variance of the totals of the sizes 

of objects assigned to one processing unit. 

1 0. The method of any one of the proceeding claims 1 to 9. whereby the ob- 
jects are database tables of various sizes. 



1 1 . The method of any one of the preceding claims 1 to 1 0, whereby each one 
of the processing units is a blade or a blade server. 



'OQd D noo. 
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12. A computer program product for assigning objects to processing units of a 
cluster of processing units, each one of tifie processing units having a stor- 
age capacity, the computer program product comprising program means 
for performing the steps of: 

a) sorting of the objects by size to provide a sequence of objects; 

b) for each processing unit of the cluster. , 

- assigning of one ore more of the objects to the processing unit 
starting with the largest object in the sequence until a remaining 
.storage capacity of the processing unit Is below the smallest ob- 
ject of the sequence; 

- deleting of the objects which are assigned to a processing unit 
from the sequence. 

1 3. The computer program product of claim 12, the program means being 

20 adapted to repeatedly canry out step 12 b) until the sequence is empty and 

to output a minimum number of the processing units, which are required 
for the objects. 

14. The computer program product of claims 12 or 13, whereby the remaining 
"25 storage capacity is determined by the difference between the storage ca- 

pacity and the aggregated size of objects being assigned to the process- 
ing unit 

1 5. The computer program product of claims 12, 13 or 14, the program 
30 means being adapted to perform the steps of : 



15 
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c) determining a largest gap between the aggregated size of objects be- 
ing assigned to one of the processing units and the storage capacity. 

d) subtracting the gap divided by the minimum number of processing 
units from the storage capacity to provide a first threshold, 

e) performing step 12 b) again, whereby the remaining storage capacity is 
the difference between the aggregated size of the objects being as- 
signed to the processing units and the first threshold. 

1 6. The computer program product of anyone of the proceeding claims 1 2 to 
1 5, the program means being adapted to perform the steps of: 

c) determining the total of the sizes of the objects, 

d) determining the difference between the total of the storage capacities 
of the minimum number of processing units and the total of the sizes of 
the objects, 

20 e) subtracting the difference divided by the minimum number of process- 

ing units from the storage capacity to provide a second threshold, 

f) performing step 12b) again, whereby the remaining storage capacity is 
determined by the difference between the aggregated size of the ob- 

25 jects being assigned to the processing unit and the second threshold, 

g) in case that as a result of step 16f) there is an excess amount of mem- 
ory requirement for one of the processing units which surpasses the 
storage capacity, dividing the excess amount by the minimum number 

31) of processing units and increasing the second threshold by the result of 

the division. 
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whereby steps 16 f) and 16 g) are performed repeatedly until there is no 
such excess amount. 

17. The computer program product of anyone of the proceeding claims 12 to 
5 ' 18, the program means being adapted to perform the steps of: 

c) stepwise varying a third threshold betvveen first and second limits, 

d) performing step 12 b) for each third threshold value, whereby the re-' 
n> maining storage capacity is the difference between the aggregated size 

of the objects being assigned to the processing unit and the third 
threshold, and whereby a statistical measure is calculated for the as- 
signment of objects to the processing unit, 

1 5 e) selecting of one of the assignments of objects to processing units 

based on the statistical measure. 

18. A data processing system for determining a minimum number of process- 
ing units of a cluster of processing units for a given number of objects hav- 

2U ing various sizes, the data processing system comprising: 

means (114) for sorting of the objects by size to provide a sequence 
of objects, 

21 : means (116) for assigning of one or more of tfie objeRte m a prnnooe. 

ing unit starting with the largest object in the sequence until a remain- 
ing storage capacity of the processing unit is below the smallest object 
of the sequence, 

•^f' - means (122, 126) for outputling of the minimum number of the proc- 

essing units. 
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19, A blade server having object size balancing means (110. 1 12) for assign- 
ing objects to a piurality of blade sefverSt the object size balancing means 
being adapted to assign objects to the blade servers by the steps of: 

5 a) sorting of the objects by size to provide a sequence of objects; 

b) for each processing unit of the cluster 

- assigning of one ore more of the objects to the processing unit start- 
.10 ing with the largest object in the sequence until a remaining storage 

capacity of the processing unit is below the smallest object of the 
sequence; 

- deleting of the objects which are assigned to a processing unit from 
15 the sequence. 
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Abstract 

A method of assigning objects to processing units 



The present invention relates to a metliod of assigning of objects to processing 
units of a cluster of processing units. First the objects to be assigned are sorted 
by size, which provides a sequence of objects. Starting with the first processing 
unit objects are assigned in sequential order. This way the loading of the proc- 
M) essing units is balanced. This invention is particularly advantageous foe as- 
signment of data base tables to blade servers. 

(Rgure 2) 
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